Gradient GPS: Turbocharge Your Diffusion Models with Targeted Tuning
dev.toยท1dยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
[D] Best (free) courses on neural networks
reddit.comยท3hยท
๐Ÿ‘๏ธAttention Optimization
Flag this post
Bayesian continual learning and forgetting in neural networks
nature.comยท2d
๐Ÿ‘๏ธAttention Optimization
Flag this post
Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing
arxiv.orgยท1d
๐Ÿ‘๏ธAttention Optimization
Flag this post
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
paperium.netยท23hยท
Discuss: DEV
๐ŸŽ“Model Distillation
Flag this post
My ML Learning Journey: From Confusion to Building a Working Model
kaggle.comยท1dยท
Discuss: DEV
๐Ÿ“‰Model Quantization
Flag this post
Part II : Building My First Large Language Model from Scratch
medium.comยท7hยท
Discuss: DEV
๐Ÿ“‰Model Quantization
Flag this post
Deep Learning โ€” 7 : Optimize your Neural Networks through Dropouts & Regularization.
pub.towardsai.netยท1d
๐ŸงฎcuDNN
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.comยท1dยท
Discuss: Hacker News
๐Ÿ‘๏ธAttention Optimization
Flag this post
The Role of GPUs in Accelerating Deep Learning Training
acecloud.aiยท2dยท
Discuss: DEV
๐Ÿ”—NCCL
Flag this post
Custom Intelligence: Building AI that matches your business DNA
aws.amazon.comยท1d
๐Ÿค–AI Coding Tools
Flag this post
From Data to Rewards: a Bilevel Optimization Perspective on Maximum LikelihoodEstimation
dev.toยท7hยท
Discuss: DEV
๐ŸŽ“Model Distillation
Flag this post
Adaptive Bias Mitigation via Synthetic Data Augmentation with Generative Adversarial Networks in Robotic Environments
dev.toยท8hยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
paperium.netยท12hยท
Discuss: DEV
โšกFlash Attention
Flag this post
Evidence on language model consciousness
lesswrong.comยท15h
๐ŸŽ๏ธTensorRT
Flag this post
New Dataset PerSense-D Enables Model-Agnostic Dense Object Segmentation
hackernoon.comยท3d
๐ŸงฎcuDNN
Flag this post
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
arxiv.orgยท1d
๐Ÿ”—Kernel Fusion
Flag this post
A Beginnerโ€™s Guide to Getting Started with add_messages Reducer in LangGraph
langcasts.comยท1dยท
Discuss: DEV
๐Ÿค–AI Coding Tools
Flag this post
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
paperium.netยท1dยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
The Machine Learning Projects Employers Want to See
towardsdatascience.comยท1d
๐ŸŽ“Model Distillation
Flag this post